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Abstract — We investigate the relation between the girth and 
the guaranteed error correction capability of 7-left regular LDPC 
codes when decoded using the bit flipping (serial and parallel) 
algorithms. A lower bound on the number of variable nodes 
which expand by a factor of at least 37/4 is found based on 
the Moore bound. An upper bound on the guaranteed correction 
capability is established by studying the sizes of smallest possible 
trapping sets. 

I. Introduction 

Iterative algorithms for decoding low-density parity-check 
(LDPC) codes have been the focus of research over the past 
decade and most of their properties are well understood [1], 
[2]. These algorithms operate by passing messages along 
the edges of a graphical representation of the code known 
as the Tanner graph, and are optimal when the underlying 
graph is a tree. Message passing decoders perform remarkably 
well which can be attributed to their ability to correct errors 
beyond the traditional bounded distance decoding capability. 
However, in contrast to bounded distance decoders (BDDs), 
the guaranteed error correction capability of iterative decoders 
is largely unknown. 

The problem of recovering from a fixed number of erasures 
is solved for iterative decoding on the binary erasure channel 
(BEC). If the Tanner graph of a code does not contain any 
stopping sets [3] up to size t (or equivalently the size of 
minimum stopping set is t + 1), then the decoder is guaranteed 
to recover from any t erasures. Orlitsky et al. in [4] studied the 
relation between stopping sets and girth and derived bounds on 
the smallest stopping set in any d-left regular bipartite graph 
and girth g. 

An analogous result is unknown for decoding on other 
channels such as the binary symmetric channel (BSC) and 
the additive white Gaussian noise (AWGN) channel. In this 
paper, we present a step toward such result for hard decision 
decoding algorithms. Gallager [5] proposed two binary mes- 
sage passing algorithms, namely Gallager A and Gallager B, 
for decoding over the BSC. He showed that for the column- 
weight 7 > 3 and p > 7, there exist (n,j,p) 1 regular low- 
density parity-check (LDPC) codes for which the bit error 
probability asymptotically tends to zero whenever we operate 
below the threshold. The minimum distance was shown to 

'Precise definitions will be given in Section II 



increase linearly with the code length, but correction of a 
linear fraction of errors was not shown. Zyablov and Pinsker 
[6] analyzed LDPC codes under a simpler decoding algorithm 
known as the bit flipping algorithm and showed that almost 
all the codes in the regular ensemble with 7 > 5 can 
correct a constant fraction of worst case errors. Sipser and 
Spielman in [7] used expander graph arguments to analyze 
two bit flipping algorithms, serial and parallel. Specifically, 
they showed that these algorithms can correct a fraction of 
errors if the underlying Tanner graph is a good expander. 
Burshtein and Miller in [8] applied expander based arguments 
to show that message passing algorithms can also correct a 
fixed fraction of worst case errors when the degree of each 
variable node is more than five. Feldman et al. [9] showed 
that the linear programming decoder [10] is also capable of 
correcting a fraction of errors. Recently, Burshtein in [11] 
showed that regular codes with variable nodes of degree four 
are capable of correcting a linear number of errors under bit 
flipping algorithm. He also showed tremendous improvement 
in the fraction of correctable errors when the variable node 
degree is at least five. 

It is well known that a random graph is a good expander 
with high probability [7]. However, the fraction of nodes 
having the required expansion is very small and hence the 
code length to guarantee correction of a fixed number of errors 
must be large. Moreover, determining the expansion of a given 
graph is known to be NP hard [12], and spectral gap methods 
cannot guarantee an expansion factor of more than 1/2 [7]. On 
the other hand, code parameters such as column weight and 
girth can be easily determined or are assumed to be known for 
the code under consideration. The approach in this paper is to 
determine the size of variable node sets in a left regular LDPC 
code which are guaranteed to have the expansion required by 
bit flipping algorithms based on Moore bound [13, p. 180]. 
The consequence of our results is that the error correction 
capability grows exponentially in girth. However, we note that 
since the girth grows logarithmically in the code length, this 
result does not show that the bit flipping algorithms can correct 
a linear fraction of errors (the proof and discussion are beyond 
the scope of this paper). 

To find an upper bound on the number of correctable errors, 
we study the size of sets of variable nodes which lead to 
decoding failures. A decoding failure is said to have occurred 



if the output of the decoder is not equal to the transmitted 
codeword [14]. The conditions that lead to decoding failures 
are well understood for a variety of decoding algorithms such 
as maximum likelihood decoding, bounded distance decoding 
and iterative decoding on the binary erasure channel. However, 
for iterative decoding on the BSC and AWGN channel, the 
understanding is far from complete. Two approaches have been 
taken in this direction, namely trapping sets [14] and pseudo- 
codewords [15]. We adopt the trapping set approach in this 
paper to characterize decoding failures. Richardson introduced 
the notion of trapping sets in [14] to estimate the error floor 
on the AWGN channel. In [16], trapping sets were used to 
estimate the frame error rate of column-weigh-three LDPC 
codes. In this paper, we define trapping sets with the help of 
fixed points for the bit flipping algorithms (both serial and 
parallel). We then find bounds on the size of trapping sets 
based on extremal graphs known as cage graphs [17], thereby 
finding an upper bound on the guaranteed error correction 
capability. 

The rest of the paper is organized as follows. In Section 
II, we provide a brief introduction to LDPC codes, decoding 
algorithms and trapping sets [14]. In Section III, we prove 
our main theorem relating the column weight and girth to the 
number of variable nodes which expand by a factor of at least 
37/4. We derive bounds on the size of trapping sets in Section 
IV, and conclude with a few remarks in Section V. 

II. Preliminaries 

In this section, we first establish the notation and then 
proceed to give a brief introduction to LDPC codes and bit 
flipping algorithms. We then give the relation between the error 
correction capability of the code and the expansion of the 
underlying Tanner graph. We finally describe trapping sets for 
the bit flipping algorithms. 

A. Graph Theory Notation 

We adopt the standard notation in graph theory (see [18] 
for example). G — (U, E) denotes a graph with set of nodes 
U and set of edges E. When there is no ambiguity, we simply 
denote the graph by G. An edge e is an unordered pair (u\, 112) 
of nodes and is said to be incident on ui and u 2 . Two nodes 
ui and U2 are said to be adjacent (neighbors) if there is an 
edge e = (ui, 112) incident on them. The order of the graph is 
\U\ and the size of the graph is \E\. The degree of u, d(u), 
is the number of its neighbors. A node with degree one is 
called a leaf or a pendant node. A graph is d-regular if all 
the nodes have degree d. The average degree d of a graph is 
defined d&d=2\E\/\U\.Vas girth g(G) of a graph G, is the 
length of smallest cycle in G. H = [V U C, E') denotes a 
bipartite graph with two sets of nodes; variable (left) nodes V 
and check (right) nodes C and edge set E' . Nodes in V have 
neighbors only in C and vice versa. A bipartite graph is said 
to be 7-left regular if all variable nodes have degree 7, p-right 
regular if all check nodes have degree p and (7,p) regular 
if all variable nodes have degree 7 and all check nodes have 
degree p. The girth of a bipartite graph is even. 



B. LDPC Codes and Decoding Algorithms 

LDPC codes [5] are a class of linear block codes which can 
be defined by sparse bipartite graphs [19]. Let G be a bipartite 
graph with two sets of nodes: n variable nodes and m check 
nodes. This graph defines a linear block code C of length n and 
dimension at least n — m in the following way: The n variable 
nodes are associated to the n coordinates of codewords. A 
vector v = (v\, V2, ■ ■ ■ , v n ) is a codeword if and only if for 
each check node, the modulo two sum of its neighbors is zero. 
Such a graphical representation of an LDPC code is called 
the Tanner graph [20] of the code. The adjacency matrix of G 
gives a parity check matrix of C. An (n, 7, p) regular LDPC 
code has a Tanner graph with n variable nodes each of degree 
7 (column weight) and nj/p check nodes each of degree p 
(row weight). This code has length n and rate r > 1 — 7/p 
[19]. 

We now describe the parallel bit flipping algorithm [6], [7] 
to decode LDPC codes. As noted earlier, each check node 
imposes a constraint on the neighboring variable nodes. A 
constraint (check node) is said to be satisfied by a setting of 
variable nodes if the sum of the variable nodes in the constraint 
is even; otherwise the constraint is unsatisfied. 

Parallel Bit Flipping Algorithm 

• In parallel, flip each variable that is in more unsatisfied 
than satisfied constraints. 

• Repeat until no such variable remains. 

A serial version of the algorithm is also defined in [7] and all 
the results in this paper hold for the serial bit flipping algorithm 
also. The bit flipping algorithms are iterative in nature but do 
not belong to the class of message passing algorithms. 

C. Expansion and Error Correction Capability 

Sipser and Spielman [7] analyzed the performance of the 
bit flipping algorithms using the expansion properties of the 
underlying Tanner graph of the code. We summarize the results 
from [7] below for the sake of completeness. We start with 
the following definitions from [7]. 

Definition 1: Let G = (U,E) with \U\ = n\. We say that 
every set of at most mi nodes expands by a factor of 5 if, for 
all sets S C U 

\S\ <mi4- \{y :3xeS such that (x,y) 6 E}\ > S\S\. 
We consider bipartite graphs and expansion of variable nodes 
only. 

Definition 2: A graph is a (7, p, a, S) expander if it is a 
(7, p) regular bipartite graph in which every subset of at most 
a fraction of the variable nodes expands by a factor of at least 
5. 

The following theorem from [7] relates the expansion and error 
correction capability of an (n, 7, p) LDPC code with Tanner 
graph G when decoded using the parallel bit flipping decoding 
algorithm. 

Theorem 1: [7, Theorem 11] Let G be a (7, p, a, (3/4 + 
6)7) expander over n variable nodes, for any e > 0. Then, 
the simple parallel decoding algorithm will correct any a < 



a(l + 4e)/2 fraction of errors after log 1 _ 4e (aon) decoding 

rounds. 

Notes: 

1) The serial bit flipping algorithm can also correct ao < 
a/2 fraction of errors if G is a (7, p, a, (3/4)7) ex- 
pander. 

2) The results hold for any left regular code as we need 
expansion of variable nodes only. 

From the above discussion, we observe that finding the number 
of variable nodes which are guaranteed to expand by a factor 
of at least 37/4, gives a lower bound on the guaranteed error 
correction capability of LDPC codes. 

D. Decoding Failures and Trapping Sets 

We now define decoding failures of the bit flipping algo- 
rithms and characterize these failures using trapping sets. 

Consider an LDPC code of length n and let x be the binary 
vector which is the input to the hard decision decoder. Let 
iS(x) be the support of x. The support of x is defined as the 
set of all positions i where x(z) 7^ 0. The set of variable nodes 
(bits) which differ from their original value are referred to as 
corrupt variables. 

Definition 3: x is a fixed point of the bit flipping algorithm 
if the set of corrupt variables remains unchanged after one 
round of decoding. 

Definition 4: [16] Let x be a fixed point. Then <S(x) is 
known as a trapping set. An (a, b) trapping set T is a set of 
a variable nodes whose induced subgraph has b odd degree 
checks. 

The size of T denoted by |T| is the number of variable nodes 
in T. From the definitions, it is clear that if the input to the 
decoder is a fixed point then the output is also the same fixed 
point. Or in other words, if the initial errors are in variable 
nodes corresponding to a trapping set, then the decoder will 
not converge to the original codeword. Hence, bounding the 
size of trapping sets gives an upper bound on the guaranteed 
error correction capability. Apart from fixed point decoding 
failures, there exist other types of failures, the discussion of 
which is beyond the scope of this paper. 

III. Expansion, Column Weight and Girth 

In this section, we prove our main theorem which relates 
the column weight and girth of a code to its error correction 
capability. We show that the size of variable node sets which 
have the required expansion is related to the well known 
Moore bound [13, p. 180]. We start with a few definitions 
required to establish the main theorem. 

A. Definitions 

Definition 5: The reduced graph H r = (V U C ri E' r ) of 
H = (V U C, E') is a graph with vertex set V UC r and edge 
set E' r given by 

C r = C \ C p , C p = {c G C : c is a pendant node} 

E' r = E'\ E' p , E' p = {(v h Cj) EE:c 3 e C p } 



Definition 6: Let H = (V U C, E') be such that Vu G 
V,d(v) < 7. The 7 augmented graph H 7 = (V U C 7 ,E') 
is a graph with vertex set V U C 7 and edge set E' given by 

\v\ 

C 7 = C U C a , where C a = (J and 

i=l 

Ca = { c li • ■ • ! c 7 -d(t)i)}- 

\V\ 

E\ = E'UE' U , where E' a = \J < 4 and 

i=l 

_ E'J - {(v uCj )eVxC a : Cj eC l a }. 

Definition 7: [7, Definition 4] The edge-vertex incidence 
graph G ev = (U U E,E ev ) of G = (U,E) is the bipartite 
graph with vertex set U U E and edge set 

E ev = {(e, u) G E x U : u is an endpoint of e}. 

Notes: 

1) The edge-vertex incidence graph is right regular with 
degree two. 

2) \E ev \ = 2\E\. 

3) g(G ev ) - 2. 9 (G). 

Definition 8: An inverse edge-vertex incidence graph 
H iev = {V, E' iev ) of H = (V U C, E') is a graph with vertex 
set V and edge set E' iev which is obtained as follows. For 
c e C r , let N(c) denote the set of neighbors of c. Label one 
node Vi G N(c) as a root node. Then 

EL = {{vi,Vj)€VxV :v i eN{c),v j &N{c), 
i ^ j, Vi is a root node, for some c G C r }. 

Notes: 

1) Given a graph, the inverse edge-vertex incidence graph 
is not unique. 

2) g(H iev ) > g(H)/2, \E> ev \ = \E' r \ - \C r \ and \C r \ < 
\E' r \/2 

3) \E' iev \ > \E' r \/2 with equality only if all checks in C r 
have degree two. 

4) The term inverse edge-vertex incidence is used for the 
following reason. Suppose all checks in H have degree 
two. Then the edge-vertex incidence graph of H iev is 
H. 

The Moore bound [13, p. 180] denoted by no(d, g) is a lower 
bound on the least number of vertices in a d-regular graph with 
girth g. It is given by 

n {d,g) = n (d, 2r + 1) = 1 + d^(d - If, g odd 

»=o 

r-l 

n Q (d,g) = n (d,2r) = 2^(d - l)\g even 

i=o 

In [21], it was shown that a similar bound holds for irregular 
graphs. 

Theorem 2: [21] The number of nodes n(d,g) in a graph 
of girth g and average degree at least d > 2 satisfies: 

n(d,g) > n (d,g) 
Note that d need not be an integer in the above theorem. 



B. The Main Theorem 

We now state and prove the main theorem. 

Theorem 3: Let C be an LDPC code with 7-left regular 
Tanner graph G. Let g(G) = 2g'. Then for all k < n (7/2, g'), 
any set of k variable nodes expands by a factor of at least 37/4. 

Proof: Let G k = {V k UC k ,E k ) denote the subgraph 
induced by a set of k variable nodes V k . Since G is 7-left 
regular, \E k \ = jk. Let G k = {V k U C k , E k ) be the reduced 
graph. We have 



and therefore 



\C k \ = 



C k 
E k p 

E k \ = \C k 



\E 



\C k p \ = lk-\E k \ 

We need to prove that \C k \ > i^k/A. 

Let f(k,g') denote the maximum number of edges in an 
arbitrary graph of order k and girth g' . By Theorem 2, for all 
k < 710(7/2, g'), the average degree of a graph with k nodes 
and girth g 1 is less than 7/2. Hence, f(k,g') < jk/4. We 
now have the following lemma. 

Lemma 1: The number of edges in G k cannot exceed 
2f(k,g')i.e., 

\E k \<2f(k,g'). 
Proof: The proof is by contradiction. Assume that \E k | > 
2f(k, g'). Consider G k ev — (V k ,E k ev ), an inverse edge vertex 
incidence graph of G k . We have 

\Et v \>f(k,g'). 

This is a contradiction as G k iv is a graph of order k and girth 
at least g' . ■ 
We now find a lower bound on \C k \ in terms of f(k,g'). We 
have the following lemma. 

Lemma 2: \C k \ > - f(k, g'). 

Proof: Let \E k \ = 2f(k,g') — j for some integer j > 
0. Then \E k \ = 7/c - 2f(k,g') + j. We claim that \C k \ > 
f(k,g') + j. To see this, we note that 



\c. 



k 

lev 
fe 



= \E k \ - \C k \, or 



\E k \ 



\El 



But 



I^Ll < fM) 

^\C k \ > 2f(k,g')-j-f(k,g') 
^\C*\ > f(k,g')-j 

Hence we have, 

\c k \ = \c k \ + \c k \ 

^>\C k \ > f(k,g')-j + 1 k-2f(k,g')+j 
^\C k \ > jk-f(k,g'). 

The theorem now follows as 

f{k,g')< 1 k/A 



\C k \ > 3 7 /c/4 



Corollary 1: Let C be an LDPC code with column-weight 
7 and girth 2g' . Then the bit flipping algorithm can correct 
any error pattern of weight less than 710(7/2, g')/2. 

IV. Cage Graphs and Trapping Sets 

In this section, we first give necessary and sufficient con- 
ditions for a given set of variables to be a trapping set. We 
then proceed to define a class of interesting graphs known as 
cage graphs and establish a relation between cage graphs [17] 
and trapping sets. We then give an upper bound on the error 
correction capability based on the sizes of cage graphs. The 
proofs in this section are along the same lines as in Section 
III. Hence, due to space considerations, we only give a sketch 
of the proofs. 

Theorem 4: Let C be an LDPC code with 7-left regular 
Tanner graph G. Let T be a set consisting of V variable nodes 
with induced subgraph 1. Let the checks in X be partitioned 
into two disjoint subsets; O consisting of checks with odd 
degree and £ consisting of checks with even degree. Then 
T is a trapping set for bit flipping algorithm iff : (a) Every 
variable node in T has at least [7/2] neighbors in £, and (b) 
No L7/2J + 1 checks of O share a neighbor outside 1. 

Proof: We first show that the conditions stated are 
sufficient. Let X7- be the input to the bit flipping algorithm, 
with support T. The only unsatisfied constraints are in O. By 
the conditions of the theorem, we observe that no variable 
node is involved in more unsatisfied constraints than satisfied 
constraints. Hence, no variable node is flipped and by defini- 
tion xt is a fixed point implying that T is a trapping set. 

To see that the conditions are necessary, observe that for 
xt to be a trapping set, no variable node should be involved 
in more unsatisfied constraints than satisfied constraints. ■ 

Remark: Theorem 4 is a consequence of Fact 3 from [14]. 

To determine whether a given set of variables is a trapping 
set, it is necessary to not only know the induced subgraph 
but also the neighbors of the odd degree checks. However, in 
order to establish general bounds on the sizes of trapping sets 
given only the column weight and the girth, we consider only 
condition (a) of Theorem 4 which is a necessary condition. 
A set of variable nodes satisfying condition (a) is known as a 
potential trapping set. A trapping set is a potential trapping set 
that satisfies condition (b). Hence, finding bounds on the size 
of potential trapping sets gives bounds on the size of trapping 
sets. 

Definition 9: [17] A (d,g)-cage graph, G(d,g), is a d- 
regular graph with girth g having the minimum possible 
number of nodes. 

A lower bound, ni(d,g), on the number of nodes n c (d,g) in 
a (d, <?)-cage graph is given by the Moore bound. An upper 
bound n u (d,g) on n c (d,g) (see [17] and references therein) 



is given by 

n u {d,g) 



4 + 29 2 g-2 for g od( j 

| + g 2»- 2 for g even 

2 (d- l)^- 2 for g odd 

4(d — l) 3-3 for g even 



Theorem 5: Let C be an LDPC code with 7-left regular 
Tanner graph G and girth 2g' . Let T(7, 2g') denote the 
the smallest possible trapping set of C for the bit flipping 
algorithm. Then, 

\T( 1 ,2g>)\=n c (\ 7 /2],g>) 
Proof: We first find a lower bound on \T(j,2g')\ and 
then exhibit a potential trapping set of size n c (py/2] , g'). We 
begin with the following lemma. 

Lemma 3: \T(j,2g')\ > n c {\j/2] ,g'). 

Proof: Let 71 be a trapping set with \T\\ < n c (\j/2~\ ,g') 
and let G\ denote the induced subgraph of T\. We can 
construct a ([7/2] ,g")- cage graph (g" > g) with \T\\ < 
n c ([7/2] ,g') nodes by removing edges (if necessary) from 
the inverse edge-vertex of G\ which is a contradiction. ■ 
We now exhibit a potential trapping set of size n c (py/2] , g'). 
Let G ev (\j/2~\ ,g') be the edge- vertex incidence graph of 
a G(Py/2] ,g'). Note that G ev ({-y/2] ,g') is a left regular 
bipartite graph with n c ([~7/2] ,g') variable nodes of de- 
gree [7/2] and all checks have degree two. Now consider 
([7/2] ,g'), the 7 augmented graph of G ev (\j/2] ,g'). 
It can be seen that G e „ i7 ([7/2] ,g') is a potential trapping 
set. ■ 

Corollary 2: Let C be an LDPC code with column-weight 
7 and girth 2g' . Then the bit flipping algorithm cannot be 
guaranteed to correct all error patterns of weight more than or 
equal to ^([7/2] ,g'). 

V. Discussion 

We derived lower bounds and upper bounds on the guar- 
anteed error correction capability of left regular LDPC codes. 
The lower bounds we derived in this paper are weak. However, 
extremal graphs avoiding three, four and five cycles have been 
studied in great detail (see [22], [23]) and these results can 
be used to derive tighter bounds when the girth is eight, ten 
or twelve. Also, since an expansion factor of 37/4 is not 
necessary (see [7, Theorem 24]), it is possible that tighter 
lower bounds can be derived for some cases. The results can 
be extended to Gallager A and Gallager B algorithms as well. 
It should be noted that the necessary and sufficient conditions 
for a set to be trapping set for Gallager A/B algorithms are 
similar (depending on the message passing rules) to those in 
Theorem 4. Our approach can be used to derive bounds on the 
guaranteed erasure recovery capability for iterative decoding 
on the BEC by finding number of variable nodes which expand 
by a factor of 7/2. In [4], the bounds on the guaranteed erasure 
recovery capability were derived based on the size of smallest 
stopping set. Both approaches give the same bound, which 
also coincide with the bounds given by Tanner in [20] for the 
minimum distance. 
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